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METHODS OF INCREASING PROTEIN EXPRESSION LEVELS 

Background of the Invention 

1 . Field of the Invention 

The present invention relates to methods of increasing protein expression 
levels whereby at least one amino acid in a polypeptide amino acid sequence 
is substituted for the amino acid, proline. 

2. Background 

The ability to obtain a high level of expression of secreted recombinant 
proteins in yeast and mammalian cells is often found to be protein dependent. 
Efforts in maximizing recombinant protein expression are often focused on 
increasing the levels of the mRNA of the recombinant gene. However, the 
rate-limiting step in the expression of certain proteins is not the level of 
mRNA but rather is due to inefficiencies in the folding, addition of post- 
radiational modifications and secretion of the recombinant proteins. 

Combining random mutagenesis with the display of proteins on the 
surface of yeast is a powerful technique to identify proteins with altered 
properties. In this technique, mutant proteins are displayed on the cell 
surface and screened for either higher-affinity binding or increased 
expression levels of the displayed protein by FACS (flouresence activated 
cell sorting). Display of mutant libraries on the yeast cell surface has been 
used to identify mutants of both single-chain antibodies and single-chain T- 
cell receptors that have a higher affinity for antigen or peptide/MHC, 
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respectively . The yeast surface display method has also been used to 
improve the level of expression of a single-chain T-cell receptor. 

There remains a need in the art for identifying stuctural characteristics 
4 or features of proteins which permit high expression of recombinant 
proteins, particularly eukaryotic proteins, whose expression may be limited 
due to inefficiences in folding, addition of post-translational modifications 
and/or secretion of proteins. 

8 

Summary of the Invention 

The present invention relates to methods of increasing protein 
12 expression levels whereby at least one amino acid in a protein amino acid 
sequence is substituted for the amino acid, proline. Preferably, the 
substitution occurs within 15 amino acids, more preferably within 10 amino 
acids and most preferably within 5 amino acids of a cysteine amino acid 
16 residue. The present invention not only includes methods for polypeptides 
with proline substitutions, but also polynucleotides with codon substitutions 
for which a codon for any amino acid, except proline, is substituted for a 
codon encoding for proline. Specifically, the codons encoding proline are 
20 CCU, CCC, CCA and CCG for RNA and CCT, CCC, CCA, and CCG for 
DNA. 

Any protein of interest can be included in the invention, provided that 
24 it can be expressed from a recombinant DNA molecule in a suitable host cell 
and remain functionally active. Preferably, the protein is one whose 
expression is limited by constraints on conformational folding. More 
preferably, the protein will contain one or more cysteine amino acid residues 
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and may require the formation of correct disulfide bonds, resulting in proper 
conformational folding of the tertiary structure of the polypeptide. 



4 The present invention solves, in part, the continuing need in the art for 

means to improve the efficiency of expression of recombinant proteins in 
mammalian and other host cells. The invention provides methods that 
overcome the problem of inefficient folding of polypepetides, which limits 

8 the expression of certain heterologous polypeptides , and provides for higher 
levels of expression of such polypeptides. 



Brief Description of the Figures 

12 Figure 1A represents a diagramatic representation of a flow cytometric 
analysis of yeast cells stained with a polyclonal antibody directed against 
TNFrED and R-phycoerthrin-conjugated donkey anti-goat IgG. Yeast cells 
contained either the pYES2 plasmid (left panel) or the pYES2-TNFrED-Agg 

16 plasmid (right panel). 

Figure IB is a diagramatic representation of a flow cytometric analysis of 
yeast cells stained with biotinylated hTNF-a and FITC-labeled avidin. 
20 Yeast contained either the pYES2 plasmid (left panel) or the pYES2- 
TNFrED-Agg plasmid (right panel). 

Figure 1C is a diagramatic representation of a two-color flow cytometric 
24 analysis of yeast cells that have been stained with biotinylated hTNF-a and 
FITC-labeled avidin and a polyclonal antibody directed against TNFrED and 
R-phycoerthrin-conjugated donkey anti-goat IgG. Yeast were transformed 
with a mutant library (left panel) or with the same library after three rounds 
28 of sorting (right panel). 

Figure ID is a diagramatic representation of a flow cytometric analysis of 
yeast cells stained with biotinylated hTNF-a and FITC-labeled avidin. In 
32 the left panel the shaded histogram represents the yeast expressing wild-type 
TNFrED whereas the unshaded histogram represents clone 11. In the ri^ht 
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panel the shaded histogram represents the yeast expressing wild-type 
TNFrED whereas the unshaded histogram represents clone 6. 

4 Figure 2 is a diagramatic representation of a saturation binding analysis of 
yeast expressing wild-type TNFrED (Fig 2A), mutant clone 6 (Fig 2B)or 1 1 
(Fig 2C). Triplicate samples of yeast were incubated with increasing 
concentrations of [ l25 I]TNF~a. Binding experiments were repeated two- 

8 three times with similar results. 

Figure 3 is a diagramatic representation of the <(>-i|/ plot of the substituted 
residues in the mutant clones 6 and 1 1 and the corresponding residues found 

12 in wild-type TNFrED wherein the shaded regions are the favorable 

conformations obtained from an analysis of 136 non-homologous protein 
structures at a resolution of 1 .8 A or higher. The darker the color, the higher 
the frequency of the <|>-\j/ angles of the residues, and hence the more favored 

16 conformation. The <|>-\|/ angles of the corresponding residue in the TNFrED 
crystal structures are shown in red squares (lncf.pdb, TNFrED dimer 
structure, 1.85 A resolution), red diamonds (lext.pdb, TNFrED dimer 
structure, 2.85 A resolution), and red triangles (ltnr.pdb, TNF/TNFrED 

20 complex structure, 2.85 A resolution). The representative high-resolution 
dataset was used to generate the green symbols and these symbols represent 
the <j)-vj/ angles of the substituted residue in the same tri- (green 'x's) or tetra- 
(green circles) amino acid context as that found in wild-type TNFrED or the 

24 mutant clones (see also Table 4). (A). Ser87. (B) Pro87. (C). His34. (D) 
Pro34. (E). Ser57. (F). Ilc57. 

Figure 4 presents the amino acid and polynucleotide sequences of TNFrED. 



Detailed Description of the Invention 

32 The present invention is based on the unexpected finding that the 

substitution of one or more amino acids for the amino acid proline in the 
amino acid sequence of a polypeptide results in higher levels of expression 
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of that protein. While not wishing to be bound by theory, evidence supports 

« 

the suggestion that the biochemical mechanism allowing higher levels of 
expression relates to the promotion of proper conformational folding of a 
4 protein. 

Through screening of mutant clones of the extracellular domain of the 
receptor for Tissue Necrosis Factor (TNFrED) with the yeast display system 

8 we were able to identify mutations of TNFrED that conferred a higher 
expression level in yeast when compared to wild-type TNFrED. One clone 
contained the S87P mutation and the other clone contained the H34P and 
S57I mutations. These mutations did not change the affinity of TNFrED for 

12 TNF-d. Expression of these mutants in mammalian cells generated similar 
findings, strongly suggesting that the mechanism through which the 
mutations increase expression is conserved between yeast and mammalian 
cells. Examination of the mutations individually or in combination revealed 

16 that either proline change increased the expression of TNFrED whereas the 
S57I mutation had no effect on expression. 

In each mutant clone, we showed that the residue responsible for the 
20 higher protein expression levels is a proline substitution next to a cysteine 
involved in a disulfide bond. In proline the nitrogen atom is part of a rigid 
ring and no rotation of the ring N-C bond is possible. Thus the choices of 
\\f angles are fewer in the proline substitutions than the histidine or serine 
24 residues found in the wild-type TNFrED (shaded areas in Fig. 3). Moreover, 
the <|)-v|/ angles of the histidine or serine residue found in the crystal 
structures of TNFrED are located in the more favored regions of the <j>-vj/ plot 
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of a proline residue. These results indicate that proline is the preferred 
residue at position 34 or 87 in terms of the main-chain conformation. This 
phenomenon is more evident in the uncomplexed structure of TNFrED with 

4 the highest resolution (pdb code: lext; see the red filled squares in Figure 3). 
Therefore, the mutations have no advantage in the complexed form 
compared to the uncomplexed form, which is in agreement with our 
observation that the mutants do not effect the affinity of TNFrED for ligand. 

8 To test if there is any change of the favorable status of a proline for the given 
(j>-V|/ angles when the adjacent one or two amino acids are also considered, 
we searched the representative PDB dataset for the same sequences in the 
mutated form and the wild-type form. There are more cases of mutated 

12 sequences than the wild-type sequences (see Table 3 and also the green Vs 
in Figure 3). These results further emphasize that proline is the preferred 
residue at position 34 or 87, both in terms of the main-chain conformation 
for that particular residue and also for when the surrounding residues and 

16 structure environment are taken into consideration. The S57I mutation is a 
different situation. It results in a narrower region of <J>-v|/ angles but deviates 
away from the preferred (j>-\|/ angle region to a slightly less preferred region 
(Fig. 3). Thus, there appears to be no advantage of the S57I mutation in 

20 term of <J>-v(/ . preference at that position. This is consistent with our 
experimental observation that the presence of the S57I mutation alone does 
not significantly alter the expression level of TNFrED. Taking all the data 
together, we conclude that introduction of the proline residues assists the 

24 local sequence of each mutant to adopt the conformations seen in crystal 
structures of TNFrED thereby fixing neighboring cysteine residues into the 
correct orientation. We proposed that proper orientation of the cysteine in 
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turn facilitates formation of the correct disulfide bond and results in a higher 
yield of correctly folded molecules. Our finding that these proline 
substitutions increase the expression level of TNFrED is consistent with this 
4 proposal. Furthermore, proline substitutions can be extended to other 
proteins where the formation of disulfide bonds is thought to be a limiting 
factor in expression. 

8 

The present invention relates to methods of increasing protein 
expression levels whereby at least one amino acid in a protein amino acid 
sequence is substituted for the amino acid, proline. Preferably, the 

12 substitution in these proline-substituted polypeptides occurs within 15 amino 
acids, more preferably within 10 amino acids and most preferably within 5 
amino acids of a cysteine amino acid residue. The present invention not only 
includes methods for polypeptides with proline substitutions, but also 

16 polynucleotides with codon substitutions for which a codon for any amino 
acid, except proline, is substituted for a codon encoding for proline. 
Specifically, the codons encoding proline are CCU, CCC, CCA, CCG for 
RNA and CCT, CCC, CCA, CCG for DNA (see Table 1). 

20 

I. Definitions: 

24 

In general, the following words or phrases have the indicated definition 
when used in the description, examples and claims. 

28 "Protein" shall mean any polypeptide comprised of amino acids and having 
a unique amino acid sequence. The term "protein" may be used 
interchangeably herein with the term "polypeptide". 
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"Substitution" shall mean the introduction of an amino acid, either by 
replacement of an existing amino acid residue or by insertion of an 
4 additional residue. In the present invention, the replacement or inserted 
amino acid is proline. 

8 "Increased Expression" shall mean higher or greater expression levels of 
protein when comparing the expression levels of a protein with its original 
amino acid sequence with the expression levels of the same protein with one 
or more proline substitutions. 

12 

"Amino Acid" shall mean an amino acid residue contained in the group 
consisting of the 20 naturally occurring amino acids. In the present 

16 application, amino acid names are used as defined by the Protein DataBank 
(PDB) ( www.odb.orai which is based on the IUPAC nomenclature (IUPAC 
Nomenclature and Symbolism for Amino Acids and Peptides (residue 
names), Eur. J. Biochem., 138, 9-37 (1984) together with their corrections in 

20 Eur. J. Biochem. , 152, 1 (1985); i.e. alanine (Ala or A), cysteine (Cys or C), 
aspartic acid (Asp or D), glutamic acid (GLU or E), phenylalanine (Phe or 
F), glycine (Gly or G), histidine (His or H), isoleucine (He or I), lysine (Lys 
or K), leucine (Leu or L), methionine (Met or M), asparagine (Asn or S), 

24 proline (Pro or P), glutamine (Gin or Q), arginine (Arg or R), serine (Ser or 
S), threonine (Thr or T), valine (Val or V), tryptophan (Trp or W), and 
tyrosine (Tyr or Y) residues. 

28 "Positioned Within" shall mean being positioned proximal to a particular 
reference amino acid in an amino acid sequence, in either the 3' or 5' 
direction. 

32 "Codon" shall mean a triplet of nucleotides coding for a single amino acid, 
(see Table 1) 

"Polynucleotide" shall mean a single stranded molecule of DNA or RNA. 

36 

The terminology used for identifying amino acid positions / 
substitutions is illustrated as follows: H34P indicates that position number 
34 in the linear sequence of amino acids for a particular protein as shown in 
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Figure 4 and SEQ ID NO: 1 is occupied by a histidine residue and that it has 
been substituted with a proline residue. Unless otherwise indicated, the 
numbering of amino acid residues made herein is made relative to the amino 

4 acid sequence shown in Figure 4 and SEQ ID NO: 1 (for Tissue Necrosis 
Factor). Multiple substitutions are indicated with a e.g., H34P + S57I 
means an amino acid sequence which comprises substitution of the histidine 
residue in position 34 by a proline residue and substitution of the serine 

8 residue in position 57 by a isoleucine residue. 



Table 1 

12 
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Methods for Generating an Expression System for Proline Substituted 
Polypeptides: 

4 The proline substituted polypeptides of the present invention may be 

produced by any suitable method known in the art. These methods include 
the construction of nucleotide sequences encoding the respective proline 
substituted polypeptides and expressing the amino acid sequence in a 

8 suitable transfected host. Proline substituted polypeptides of the present 
invention may also be produced by chemical synthesis or by a combination 
of chemical synthesis and recombinant DNA technology. 

Proline substituted polypeptides of the invention may be constructed 

12 by isolating or synthesizing a nucleotide sequence encoding the parent. The 
nucleotide sequence is then changed so as to effect the substitution of the 
relevant amino acid residue(s). The nucleotide sequence can be modified by 
site directed mutagenesis as in the Examples of the present specification. In 

16 the alternative, the nucleotide sequence may be prepared by chemical 
synthesis, wherein oligonucleotides are designed based on the specific 
amino acid sequence of the proline susbstituted polypeptides. 

The nucleotide sequence encoding the proline susbstituted polypeptide 

20 is inserted into a recombinant vector and operably linked to control 
sequences necessary for expression of the polypeptide in the desired 
transfected host cell. One of skill in the art may make a selection among 
these vectors, expression control sequences and hosts without undue 

24 experimentation. 

The recombinant vector may be an autonomously replicating vector, 
i.e. a vector which exists as an extrachromosomal entity, the replication of 
which is independent of chromosomal replication, e.g. a plasmid. 

* 
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Alternatively, the vector is one which, when introduced into a host cell, is 

■ 

integrated into the host cell genome and replicated together with the 

chromosome(s) into which it has been integrated. 
4 The vector is preferably an expression vector in which the nucleotide 

sequence encoding the polypeptide of the invention is operably linked to 

additional segments required for transcription of the nucleotide sequence. 

The vector is typically derived from plasmid or viral DNA. A number of 
8 suitable expression vectors for expression in the host cells mentioned herein 

are commercially available or described in the literature 

The recombinant vector may further comprise a DNA sequence 

enabling the vector to replicate in the host cell in question. An example of 
12 such a sequence, (when the host cell is a mammalian cell) is the SV40 origin 

of replication. 

The vector may also comprise a selectable marker, e.g. a gene whose 
product complements a defect in the host cell, such as the gene coding for 
16 dihydrofolate reductase (DHFR) or one which confers resistance to a drug, 
e.g. ampicillin, kanamycin, tetracycline chloramphenicol, neomycin, 
hygromycin or methotrexate. 

The vector may also comprise an amplifiable gene, such as DHFR, such 
20 that cells having multiple copies of the amplifiable gene and flanking 
sequences, including the proline substituted polypeptide DNA, can be 
selected for on appropriate media. 

The term "control sequences" is defined herein to include all 
24 components which are necessary or advantageous for the expression of the 
polypeptide of the invention. Examples of suitable control sequences for 
directing transcription in mammalian cells include the early and late 
promoters of SV40 and adenovirus, e.g. the adenovirus 2 major late 
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promoter, the MT-1 (metallothionein gene) promoter and the human 
cytomegalovirus immediate-early gene promoter (CMV). 

The nucleotide sequences of the invention encoding the proline 
substituted polypeptides, whether prepared by site-directed mutagenesis, 
synthesis, PCR or other methods, may optionally also include a nucleotide 
sequence that encodes a signal peptide. The signal peptide is present when 
the polypeptide is to be secreted from the cells in which it is expressed. 
Such signal peptide, if present, should be one recognized by the cell chosen 

■ 

for expression of the polypeptide. The signal peptide may be homologous 
(e.g. be that normally associated with the parent polypeptide) or 
heterologous (i.e. originating from another source than parent polypeptide) 
to the polypeptide or may be homologous or heterologous to the host cell, 
i.e. be a signal peptide normally expressed from the host cell or one which is 
not normally expressed from the host cell. 

Any suitable host may be used to produce the proline susbstituted 
polypeptide of the invention, including bacteria, fungi (including yeasts), 
plant, insect, mammal, or other appropriate animal cells or cell lines, as well 
as transgenic animals or plants. Examples of suitable mammalian host cells 
include Chinese hamster ovary (CHO) cell lines, (e.g. CHO-KL; ATCC 
CCL-61), Green Monkey cell lines (COS) (e.g. COS 1 (ATCC CRL-1650), 
COS 7 (ATCC CRL-1651)); mouse cells (e.g. NSIO), Baby Hamster Kidney 
(BI-EK) cell lines (e.g. ATCC CRL-1632 or ATCC CCL-10), and human 
cells (e.g. BEK 293 (ATCC CRL-1573)), as well as plant cells in tissue 
culture. Additional suitable cell lines are known in the art and available 
from public depositories such as the American Type Culture Collection, 
USA. Methods for introducing exogeneous DNA into mammalian host cells 
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include calcium phosphate-mediated transfection, electroporation, DEAE- 
dextran mediated transfection, liposome-mediated transfection and viral 
vectors. 

■ 

Cells are cultivated in a nutrient medium suitable for production of the 
proline substituted polypeptide using methods known in the art. For 
example, the cell may be cultivated by shake flask cultivation, small-scale or 
large-scale fermentation (including continuous, batch, fed-batch, or solid 
state fermentations) in laboratory or industrial fermenters performed in a 
suitable medium and under conditions allowing the proline susbstituted 
polypeptide to be expressed and/or isolated. The cultivation takes place in a 
suitable nutrient medium comprising carbon and nitrogen sources and 
inorganic salts, using procedures known in the art. Suitable media are 
available from commercial suppliers or may be prepared according to 
published compositions (e.g. in catalogues of the American Type Culture 
Collection). If the proline susbstituted polypeptide is secreted into the 
nutrient medium, it can be recovered directly from the medium. If the 
proline susbstituted polypeptide is not secreted, it can be recovered from cell 
lysates. 

The resulting proline substituted polypeptide may be recovered by 
methods known in the art. For example, it may be recovered from the 
nutrient medium by conventional procedures including, but not limited to, 
centrifugation, filtration, extraction, spray drying, evaporation, or 
precipitation. 

The proline substituted polypeptides may be purified by a variety of 
procedures known in the art including, but not limited to, chromatography 
(e.g. ion exchange, affinity, hydrophobic, chromatofocusing, and size 
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exclusion), electrophoretic procedures (e.g. preparative isoelectric focusing), 
differential solubility (e.g. ammonium sulfate precipitation), SDS-PAGE, or 
extraction, 

* 

Pharmaceutical composition of the invention and its use 

In one aspect, the proline substituted polypeptide or the 
pharmaceutical composition according to the invention is used for the 
manufacture of a medicament for treatment of diseases, disorders or 
conditions. 

In another aspect the proline susbstituted polypeptide or the 
pharmaceutical composition according to the invention is used in a method 
of treating a mammal, in particular a human, comprising administering to the 
mammal in need thereof such proline susbstituted polypeptide or 
pharmaceutical composition thereof. 

It will be apparent to those of skill in the art that an effective 
amount of a conjugate, preparation or composition of the invention depends, 
inter alia, upon the disease, the dose, the administration schedule, whether 
the polypeptide or conjugate or composition is administered alone or in 
conjunction with other therapeutic agents, the serum half-life of the 
compositions, and the general health of the patient. Typically, an effective 
dose of the preparation or composition of the invention is sufficient to 
ensure a therapeutic effect. 

The proline susbstituted polypeptide produced by the methods of the 
invention is normally administered in a composition including one or more 
pharmaceutical^ acceptable carriers or excipients. "Pharmaceutically 
acceptable" means a carrier or excipient that does not cause any untoward 
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■ 

effects in patients to whom it is administered. Such pharmaceutical^ 
acceptable carriers and excipients are well known in the art, and the 
polypeptide or conjugate of the invention can be formulated into 
pharmaceutical compositions by well-known methods (see e.g. Remington's 
Pharmaceutical Sciences, 18th edition, A. R. Gennaro, Ed., Mack Publishing 
Company (1990); Pharmaceutical Formulation Development of Peptides and 
Proteins, S. Frokjaer and L. Hovgaard, Eds., Taylor & Francis (2000); and 
Handbook of Pharmaceutical Excipients, 3rd edition, A. Kibbe, Ed., 
Pharmaceutical Press (2000)). Pharmaceutically acceptable excipients that 
may be used in compositions comprising the polypeptide or conjugate of the 
invention include, for example, buffering agents, stabilizing agents, 
preservatives, isotonifiers, nonionic surfactants or detergents ('Vetting 
agents"), antioxidants, bulking agents or fillers, chelating agents and 
cosol vents. 

The pharmaceutical composition of the proline susbstituted 
polypeptide of the invention may be formulated in a variety of forms, 
including liquids, e.g. ready-to-use solutions or suspensions, gels, 
lyophilized, or any other suitable form, e.g. powder or crystals suitable for 
preparing a solution. The preferred form will depend upon the particular 
indication being treated and will be apparent to one of skill in the art. 

. The pharmaceutical composition containing the proline susbstituted 
polypeptide of the invention may be administered intravenously, 
intramuscularly, intraperitoneally, intradermally, subcutaneously, 
sublingualy, buccally, intranasally, transdermally, by inhalation, or in any 
other acceptable manner, e.g. using PowderJect@ or ProLease(D technology 
or a pen injection system. The preferred mode of administration will depend 

■ 

upon the particular indication being treated and will be apparent to one of 
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skill in the art. In particular, it is advantageous that the composition be 
administered subcutaneously, since this allows the patient to conduct the 
self-administration. 

The pharmaceutical composition of the invention may be 
administered in con-junction with other therapeutic agents. These agents 
may be incorporated as part of the same pharmaceutical composition or may 
be administered separately from the polypeptide or conjugate of the 
invention, either concurrently or in accordance with any other acceptable 
treatment schedule. In addition, the polypeptide, conjugate or 
pharmaceutical composition of the invention may be used as an adjunct to 
other therapies. 



Exemplification: Enhanced Expression of Tissue Necrosis Factor (TNF) 

The extracellular domain of the p55 TNF receptor (TNFrED) was 
randomly mutated and libraries of TNFrED mutants were displayed on the 
surface of yeast cells. Two mutant TNFrED clones were identified by 
fluorescence-activated cell sorting (FACS) that expressed two- to five-fold 
higher in yeast compared to wild-type TNFrED. In one mutant clone there 
was a Ser to Pro change at position 87 and in the other mutant clone there 
was a His to Pro change at position 34 and a Ser to He change at position 57. 
The presence of either the S87P or H34P mutation resulted in higher 
expression levels of TNFrED in HEK293-EBNA cells whereas the S57I 
mutation had no effect on expression. These substituted residues did not 
have an effect on the affinity for TNF-a. Examination and analysis of the 
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substituted residues in the crystal structures of TNFrED indicates that the 
introduction of proline residues likely assists the local sequence of the 
mutants to adopt favorable conformations that fix the neighboring cysteine 
4 residues into the correct orientation for proper disulfide bond formation. 
This facilitation of the formation of selected disulfide bonds then results in a 
higher yield of correctly folded molecules in both yeast and mammalian 
cells. 

8 

* 

Example 1 : Expression of functional TNFrED on the surface of S. cerevisae 

12 

The extracellular domain of the p55 TNF receptor (TNFrED) was fused to 
the C-terminal portion of a-agglutinin and expressed in the S cerevisae 
strain BJ2168 (a, prclAOl, prblA 122, pep4-3, leu2, trpl, urai-52) using 

16 the pYES2 vector. The C-terminal portion of a-agglutinin is tightly 
anchored in the cell wall and serves as a scaffold to present TNFrED on the 
cell surface. To facilitate the transport of the TNFrED-agglutinin fusion 
protein into the secretory pathway, the signal sequence of TNF receptor was 

20 replaced with the yeast invertase signal sequence. The TNFrED-agglutinin 
fusion gene was under the regulation of the inducible GAL1 promoter. 
Switching the carbon source of the yeast culture from glucose to galactose 
resulted in induction of the GAL1 promoter and expression of TNRrED- 

24 agglutinin. In a flow cytometric analysis with polyclonal antibodies directed 
against TNFrED we found that in an induced culture approximately 70% of 
the yeast expressed TNFrED-agglutinin on the cell surface (Fig. 1A). To 
determine whether TNFrED on the yeast cell surface was folded correctly 
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and could bind TNF-a we performed a flow cytometric analysis using 
biotinylated TNF-a as a probe and FITC-labeled avidin as the detection 
reagent. Yeast expressing the TNFrED-agglutinin fusion gene bound more 
4 biotinylated TNF-a than did yeast containing the pYES2 vector (Fig. IB). 
Adding excess unlabeled TNF-a (data not shown) reversed the shift in the 
histogram seen with the yeast expressing the TNFrED-agglutinin fusion 
gene. 

8 

Example 2: Selection of mutant TNFrED clones with enhanced biotinylated- 
TNF-ot binding. 

12 A modification of the mutagenesis approach previously described 

(Hermes,J.D., Parekh,S.M, Blacklow,S.C, Koster,H. & Knowles,J.R. A 
reliable method for random mutagenesis: the generation of mutant libraries 
using spiked oligodeoxyribonucleotide primers. Gene 84, 143-151 (1989)) 

16 was used to generate random mutant libraries. In this approach mutant 
oligonucleotides are produced by spiking a predetermined level of the 
"wrong" nucleotides at each position. The level of contamination of the 
wrong nucleotides was adjusted to generate either an average of two or three 

20 random point mutations per oligonucleotide. Each mutant library covered 
between 40 to 105 base pairs of TNFrED. Ten random clones from each 
library were sequenced. The type and the position of mutations were 
random and the regions were found to contain the anticipated average 

24 number of mutations. Approximately 40-50% of the clones contained either 
a small deletion or insertion, which results in the expression of a truncated 
TNFrED. Presumably these deletions or insertions resulted from errors 
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incorporated during the oligonucleotide synthesis process. The size of each 
library was between 0.5 x 10 6 and 10 x 10 6 independent mutant clones. Each 
library was transformed into the strain BJ2168 and approximately 1 x 10 6 
independent transformants were selected for binding to both biotinylated- 
TNF-a and polyclonal antibodies directed against the TNFrED. The- 
window of the two-dimensional fluorescence histogram that was chosen to 
select for the subpopulation of yeast expressing active TNFrED is shown in 
Fig. 1C. The selected subpopulation of yeast was grown and reselected with 
two-color sorting. After several rounds of two-color sorting the population 
of yeast in the selected window was enriched (Fig. 1C). Following three to 
four rounds of cell sorting, individual clones were analyzed by examining 
binding of biotinylated-TNF-a to the TNFrED on the cell surface. A 
majority of the clones analyzed from each sorted library appeared to bind 
higher levels of biotinylated-TNF-a. The mutant TNFrED plasmid was 
recovered from each yeast clone and re-transformed into the BJ2 168 strain 
and then analyzed by flow cytometry for binding of biotinylated-TNFa. The 
vast majority of yeast clones were false positives as the recovered plasmids 
did not confer higher levels of biotinylated-TNF-a binding. When false 
positive yeast clones were analyzed in the absence of biotinylated TNF-a 
and avidin-FITC, they were found to be shifted as compared to the parental 
BJ2168 strain. These false positive yeast clones are approximately 30% 
larger in size compared to the parental BJ2168, which is presumably what 
gives rise to the shift in the baseline absorbance. However, two mutant 
clones, 6 and 1 1 , from different libraries were identified that when 

* 

transformed into the BJ2138 strain conferred higher levels of binding of 
biotinylated TNF and therefore are true positives (Fig. 1 D). 



Applied Research Systems ARS Holding N.V 
WO 495 



WO 03/046160 



PCT/EP02/13059 



20 



Example 3: Characterization of mutant clones 6 and 1 1. 

4 The TNFrED coding region in mutant clones 6 and 1 1 was sequenced and, 
as anticipated, the mutations for each clone were only found in the sequence 
region mutated for that specific library. In mutant clone 1 1 there was a point 
mutation that results in Ser to Pro change at position 87 and in mutant clone 

8 6 there were two point mutations that result in an His to Pro change at 
position 34 and a Ser to lie change at position 57. 

To determine whether the mutations increase binding of biotinylated TNF 
12 through increased expression of TNFrED or by increasing the affinity of 
TNFrED for TNF, we performed a saturation binding experiment on yeast 
expressing either mutant clone 6 or 1 1 . Analysis of saturation binding 
experiments (Fig. 2) indicated that yeast expressing either mutant clone 6 or 
16 11 express higher levels of functional TNFrED than wild-type TNFrED and 
that the presence of these mutations does not affect the affinity of TNFrED 
for TNF-a. The approximate number of receptors/cell for yeast expressing 
mutant clone 6, mutant clone 11 and wild-type TNFrED was 3930, 1490, 
20 740, respectively. 

Example 4: Expression of mutated TNFrEDs in mammalian cells. 

24 Next, it was determined whether characteristics of the mutant clones derived 
in yeast are similar in mammalian cells. Toward this goal we constructed a 
mammalian expression vector containing the human growth hormone signal 
sequence fused at the amino terminus to the sequence encoding mature 
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TNFrED. We have previously found that the human growth homione signal 
sequence is more efficient in inducing the secretion of TNFrED than the 
native signal sequence. Site-directed mutagenesis was used to incorporate 
each mutation found in mutant clones 6 and 11, either individually or in 
various combinations. The TNFrED mutants were transiently expressed in 
HEK293-EBNA cells and the amount of secreted TNRrED was quantitated 
with an ELISA specific for TNFrED. The results (Table 2) indicated that 
either mutation H34P or S87P increased the expression level of TNFrED. 
Moreover, the relative increase in expression is similar to what is seen when 
these mutants are expressed in yeast (Fig. 2). The S57I mutation alone did 
not alter the expression level of TNFrED (Table 2). The effects of these 
mutations do not appear to be additive as the presence of both H34P and 
S87P on the same construct did not increase the level of TNFrED secreted in 
comparison to that found with H34P alone. 

The kinetics of TNF binding to the various mutants of TNFrED by surface 
plasmon resonance in a BIAcore instrument was also measured. The results 
in Table 2 indicate no difference between wild type and mutant TNFrEDs 
tested with respect to TNF-a binding. As indicated in Experimental 
Protocol, we use mild regeneration conditions and typically collect 30-40 
cycles of data on every surface. We averaged results obtained on three 
different surfaces to obtain the means (and 95% confidence intervals) of 
parameters for the control. The principal error introduced by this manner of 
data acquisition is an increase in the variance of fitted parameters for 
controls when data are pooled across different surfaces and different runs 
(see result for purified TNFrED in Table 3). Nevertheless, the data are good 
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enough to detect a 15% change in the ko n , a 33% change in koff , or both. 

This is evidenced in the results we show for negative control and HBS 
(Hepes buffered saline) data collected after 27 and 36 cycles were acquired 
on the respective surfaces. This was done to illustrate the worst case 
scenario since no TNF-a binding could be measured when these "ligands" 
(negative control, conditioned medium, and HBS) were tested at the 
beginning of a run. The residue from a 36 cycle buildup of various partially 
denatured TNF receptor proteins demonstrates strikingly different kinetics 
from intact control, wild type or any mutated TNF receptor. Consistent with 
our findings from the yeast expression studies, the presence of H34P, S87P 
or S57I mutations does not alter the affinity of TNFrED for TNF-a. 

Table 2. Transient Transfection of HEK293-EBNA cells with wild-type 
TNFrED and mutants of TNFrED. 



DNA 


ng/ml of 


Fold over wild- 




TNFrED 


type 


wild-type 


239 + 5.0 




H34P 


724 + 40 


3 


S57I 


190 ±19 


0.8 


S87P 


361 ±21 


1.5 


,H34P, S57I 


614 + 54 


2.6 


H34P, S87P 


71 1 ± 33 


3 


H34P, S57I, 


704 ± 26 


2.9 


S87P 







HEK293-EBNA cells were transiently transfected with plasmid DNA. 48 h 
post-transfection the conditioned medium was collected and the amount of 
TNFrED was quantitated with an ELISA. The amount of TNFrED is the 
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mean of four dishes + the standard deviation. This experiment was repeated 
three times and similar results were obtained. A single representative 
experiment is shown. 

Table 3. Binding Kinetics and TNF Affinity of TNF-rED mutants. 



Sample 


x 10 6 
M's" 1 


korr 

x 10 3 s 1 


x 10" 9 M 


RU 


x 2 


wild-type 
TNFrED 


2.0 


0.8 


0.4 


88.9 


7.2 


H34P 


1.9 


0.9 


0.5 


88.5 


7.6 


S57I 


1.9 


0.9 


0.5 


85.4 


7.0 


S87P 


2.0 


1.1 


0.6 


90.3 


4.7 


H34P + S57I 


1.9 


1.2 


0.6 


84.2 


3.9 


H34P + S87P 


1.9 


1.2 


0.6 


86 


4.9 


H34P + S87P + 
S57I 


1.8 


1.0 


0.6 


82 


4.7 


negative control 


1.3 


2.0 


1.6 


25.1 


0.9 


HBS 


1.3 


1.9 


1.5 


24.2 


0.8 


purified TNFrED 
Avg ± 95% C.I. 


2.0 ±0.3 


0.9 ±0.3 


0.5 ±0.1 


77.5 ± 3 





Each TNFrED and control sample listed was analyzed with seven different 
concentrations of TNF-ot (see Experimental Protocol). For a seven level 
data set ko n , kofr and were fit globally using a model which incorporates 
baseline drift. The negative control (conditioned medium) and the HBS 
control show K<j values of ~1.5nM due to the mild regeneration conditions 
used (see Experimental Protocol). Under these conditions, we routinely note 
analyte-independent 0.5-1.0 RU/cycle upward movement in the absolute 
value of Rpoint baseline (data not shown). We suspected this result was due 
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to small amounts of partially denatured TNFrED and mutant TNFrED 
remaining after each regeneration treatment and checked to see that it did not 
depend upon the nature of the mutant (data not shown). Analysed prior to a 
27-36 cycle run of data collection, negative controls showed no TNF-a 
binding. 

Example 5; Examination of the locations of substituted residues in crystal 
structures of TNFrED 

The proline substitutions in mutant clone 6 and 1 1 are adjacent to cysteine 
residues (cysteines 33 and 88) involved in disulfide bonds. To analyze 
whether there is a potential structural role for the mutant residues, we 
tabulated the conformational preferences of proline, serine, histidine and 
isoleucine residues in the crystal structures of TNFrED (pdb codes: lext, 
lncf and ltnr) and in general protein structures. Since the conformation of 
polypeptides is controlled by the <J>-vj/ angles of the residues, we plotted the 
<|>-\y distribution of those residues in the dataset of high-resolution and non- 
homologous protein structures. As shown in Figure 3, the <J>-vj/ distribution 
of a proline residue is more restricted than that of the other three residues, 
and the <Jm|/. distribution of a serine residue is greater than that of an 
isoleucine residue. Specifically, the size ratio of the distributions for serine 
to proline is 2.5, for histidine to proline is 1.8, and for serine to isoleucine is 
1.4. Moreover, the <J>-\|/. angles of S87 and of H34 in TNFrED crystal 
structures (red filled marks) preferentially overlap the favored region of 
proline's <j)-vj/ plot as compared to the favored region of <j>-\}/ plots for serine 
or histidine. This preference is slightly more noticeable in the crystal 
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structures of TNFrED itself (pdb code: lext; resolution 1.85 A [red squares], 
or Incf; resolution 2.85 A [red diamonds]) than in the crystal structure of the 
TNF/TNFrED complex (pdb code: ltnr; resolution 2.85 A [red triangles]). 
4 In contrast, the <J>-vjy angles of Ser57 in TNFrED crystal structures are located 
in a slightly more favored region in the serine <|>-vj/ plot compared to that 
found with isoleucine, although the strongly favored region of either residue 
does not significantly overlap the distribution taken from crystal structures. 

8 

To assess the effect of adjacent residues in the choice of <Jmj/ preferences by 
the substituted residue, we analyzed the <|>-\}/. angles in the pdb database 
representative set containing the same tri- (green 'x's) or tetra- (green 

12 circles) peptide fragments as those in clone 6, clone 11 or wild-type 
TNFrED. As shown in Figure 3 and listed in Table 4, there are more cases 
of proline residues in which the <J>-\|/ angles are similar to those of H34 or 
S87 in the TNFrED crystal structures than there are of histidine or serine 

16 residues with <|>-\j/. angles similar to those of H34 or S87. The data are 
consistent with the <j>-\|/ angles of S57 in the TNFrED crystal structures being 
slightly more favored for a serine residue than for an isoleucine. 

20 



24 
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Table 4. List of crystal structures containing the same tri- or tetra- amino 
acid sequence as those in the wild-type TNFrED or mutant clones. 



Resid 
ue 


Surrounding 
Sequence 


Pdb code containing the 
sequence 


Similar/ 
distinct 
ratio 

• 

■ * 


Distinct <b-vi/ 

(a) 


Similar <|>-i|/ (a) 


Ser87 
in 

wild- 
type 


Ser-Ser-Cys 

(b) 


larb, lnxb, 
2pia, 3ebx, 
lbgc, lfrr 


ltml, 4cha 


2/6 


Ue-Ser-Ser- 
Cys or Ser- 
Ser-Cys-Thr 


None 


None 




Pro87 
in 

clone 
6 


Ser-Pro-Cys 


2tgi 


2ctc, ltfg 


2/1 


Ile-Ser-Pro- 
Cys or Ser- 
Pro-Cys-Thr 


None 


None 


• 


His34 
in 

wild- 
type 


Cys-His-Lys 


3cy3 


None 


0/1 


Lys-Cys-His- 
Lys or Cys- 
His-Lys-Gly 


None 


None 




Pro34 
in 

clone 
11 


Cys-Pro-Lys 


ltgx(2 
locations) 


ltgs, 3sic, 8can 


3/2 


Lys-Cys-Pro- 
Lys 


None 


8can 


1/0 


Cys-Pro-Lys- 
Gly 


None 


ltgx 


1/0 


Ser57 
in 

wild- 
type 


Glu-Ser-Gly 


lfha (2 
locations), 
Imam, 2had, 
2tmd, 3gbp 


lads, Ibtc, 
lfba, lhil, 
llga, 3il8, 
lnar, lptx, 
lttb, 4fxn, 
1 ovb, 2pia 


12/6 


Glu-Ser-Gly- 
Ser 


lovb 


None 


0/1 

• 


Cys-Glu-Ser- 
Gly 


2pia 


None 


0/1 


Ile57 
in 


GluJle-Gly 


2mnr, 2tmd, 
8 can 


lbtl, 2ak3, 
lppb, lxim 


4/3 
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clone 


Cys-Glu-77e- 


None 


None 




11 


Gly or Glu- 










Ile-Gly-Ser 









(a) . The criteria for similar <j>-v|/ angles is that the <j>-\|/ angle of the subject 
residue is within 15 degree of the <J>-\(/ angles found in the corresponding 

4 residue of any of the TNFrED crystal structures. 

(b) . The residues in bold are either the substituted residue in the mutant 
clone or the residue found in wild-type TNFrED. 



8 Experimental Protocols Used in the Examples 

Plasmids. 

The TNFrED-agglutinin fusion was constructed by linking the signal 
sequence from the invertase gene to the hTNFrED sequence encoding 

12 residues 12 to 172 which was then fused to the C-terminal portion of the <x- 
agglutinin gene encoding residues 330 to 650 (1, 27). Between TNFrED and 
the a-agglutinin gene was a sequence encoding the flexible linker 
GQPAAAPA. This linker is similar to the sequence of hinges between 

16 domains in immunoglobulins . The TNFrED-agglutinin fusion gene was 
subcloned into the pYES2 vector (Invitrogen Corp.) to generate pYES2- 
TNFrED-Agg for expression in yeast. 

20 The hTNFrED coding sequence from residue 12 to residue 172 was 
subcloned into pCMV, a mammalian expression vector, to generate pCMV- 
TNFrED. In pCMV-TNRrED the human growth hormone signal sequence 
is fused upstream of the hTNFrED sequence. The expression of hTNFrED 

24 in pCMV is regulated by the human cytomegalovirus (CMV) immediate- 
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early enhancer/promoter region. Downstream of hTNFrED is the human 
growth hormone polyadenylation signal. pCMV-TNFrED containing either 
the mutation S57I or S87P was generated by subcloning the appropriate 
region from mutant clones 6 and 11, respectively. pCMV-TNFrED 
containing the H34P mutation was generated using the whole vector PCR 
technique (6). Combinations of the mutations of TNFrED (H34P+S57I, 
H34P+S87P, H34P+S57I+S87P) was accomplished by subcloning the 
appropriate regions. The sequence of the coding region of all constructs was 
verified with the Thermo Sequenase radiolabeled terminator cycle 
sequencing kit (Amersham Pharmacia Biotech). 

Expression of TNFrED on the surface of yeast cells. 
Saccharomyces cerevisiae strain BJ2168 (a,prc7-407,prW-l 122, pep4-3 9 
leu2, irpl, ura3-52\ Yeast Genetic Stock Center, Berkeley, CA) was 
transformed with either pYES2 or pYES2-TNFrED-Agg using the lithium 
acetate method previously described (Gietz,R.D., Schiestl,R.H., 
Willems,A.R. & Woods,R. A. Studies on the transformation of intact yeast 
cells by the LiAc/SS-DNA/PEG procedure. Yeast. 11, 355-360, 1995). 
Transformed yeast cells were grown overnight in Ura" medium 
supplemented with 2% glucose at 30°C with shaking. Expression was 
induced by growing the transformed yeast overnight at 30°C with shaking in 
Ura" medium containing 2% galactose and l%raffinose. Cells were 
harvested by centrifuging at 16,060 x g for 2 min, washing twice with 
Dulbecco's phosphate-buffered saline (PBS) (Gibco/BRL) and diluting the 
cells to 4 x 10 6 cells/ml. 4 x 10 5 cells (100 ul) were incubated with either 
biotinylated hTNF-a (50 nM or 10 nM) or goat anti-human sTNF RI 
antibodies (0.7 ug/ml, R&D Systems) or both for 1 h at room temperature in 
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a final volume of 140 ul hTNF-cx (Protein Purification group, Serono 
Reproductive Biology Institute) was biotinylated with the EZ-link Sulfo- 
NHS-LC-Biotinylation kit (Pierce Corp.). Following the incubation, cells 

4 were centrifuged at 16,060 xg for 2 min and re-suspended in 140 ul of ice- 
cold PBS containing 0. 1% bovine serum albumin (BSA). FITC-labeted 
avidin (2.2 ug/ml, Jackson ImmunoResearch) or R-Phycoerythrin- 
conjugated donkey anti-goat IgG (2.2 ug/ml, Jackson ImmunoResearch) or 

8 both were added to cells in a total volume of 180 ul and incubated at 4°C for 
45 min. Cells were centrifuged at 16,060 x g for 2 min, washed once with 
ice-cold IX RDF1 buffer (R&D Systems), re-suspended in -400 ul of IX 
RDF1 buffer and analyzed on Becton Dickinson FACSort. The event rate 

12 was set at approximately 150 cells/sec and a total of 10,000 cells were 
collected per analysis. The yeast population was gated according to light 
scatter (size) to avoid analysis of clumped cells. 



16 Production and selection of random mutant libraries: 

Five unique restriction endonuclease recognition sites were introduced into 
the coding region of TNFrED by silent mutagenesis using the GeneEditor in 
vitro Site-Directed Mutagenesis System (Promega Corp.). This step was 

20 completed in order to divide the TNFrED into 6 regions of between 40 to 
105 bp. Five of the six regions were separately subjected to a modification 
of a random mutagenesis method previously described. Briefly, long 
oligonucleotides (Midland Certified Reagent Company) spanning a region of 

24 TNFrED flanked by unique restriction endonuclease recognition sites were 
generated that contain a predetermined amount of the three "wrong" 
phosphoramidites at each position. The amount of spiked wrong 
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phosphoramidites was adjusted to generate an average of either two or three 
mutations per oligonucleotide. Primers flanking the mutated, long 
oligonucleotide were used in a polymerase-chain reaction to amplify the 

4 DNA into cassettes for each region of TNFrED. Randomly mutagenized 
DNA regions were digested with the appropriate restriction endonucleases 
and ligated into the pYES2-TNFrED-Agg construct. Ligation reactions for 
each mutant library were transformed into XLIO-Gold ultracompetent cells 

8 (Stratagene Corp.) using a ratio of 1 ul of ligation mixture per 70 ul of 
competent cells following the manufacturer's protocol. Following the 
transformation, 20 transformation mixes from each mutant library were 
pooled and grown overnight at 37°C in 500 ml of NZY medium containing 

12 50 ug/ml of ampicillin. Ten random clones from each mutant library were 
sequenced using the Thermo Sequenase radiolabeled terminator cycle 
sequencing kit (Amersham Pharmacia Biotech). Approximately 5.0 ug of 
DNA from each random library was transformed into ten aliquots (1 x 10 9 

16 cells/aliquot) of BJ2168 cells using the lithium acetate transformation 
method. Cells were grown for 24-30 h at 30°C with shaking. Then 
approximately 1 x 10 8 cells were grown overnight at 30°C with Ura* medium 
containing 2% galactose and 1% raffinose for the induction of expression. 

20 For each FACS experiment, 4 x 10 6 cells were labeled as described above 
using biotinylated hTNF-a at a final concentration of 50 nM for the mutant 
library containing mutant clone 6 or 10 nM for the other mutant libraries. 
Goat anti-human sTNF Rl antibodies were added along with FITC-avidin 

24 and R-phycoerythrin-conjugated anti-goat IgG. A total of 1.2 x 10 7 cells 
was sorted for each library. FACS was completed on a Becton Dickinson 
FACsort with an event rate of <2000 cells/sec. The first round of sorting 
was performed in exclusion mode and subsequent sorting was completed in 
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single cell mode. Collected cells were seeded into selection medium with 
glucose and 1/100 volume was plated for calculating actual number of cells 
collected. Selected cells were re-grown at 30°C and then induced in 
selection medium with galactose and raffinose for the next round of sorting. 
Each library was sorted a total of three to four times. Approximately 0.08%- 
0.4% of cells were collected in the first round, and 0.01%-0.2% in the 
subsequent rounds. The collected cells from the last round of sorting were 
plated Ura" plates to yield single colonies. 

Recovery and analysis of mutant TNFrED clones: 

Approximately 50 individual yeast clones from each library sort were 
analyzed by flow cytometry. 100 ul of induced cells at 0.15 OD 6 oonm were 
incubated with either 50 nM or lOnM biotinylated TNF-a in a total volume 
of 140 ul, under same conditions as described above. Binding of TNF-a was 
detected with FITC-avidin (2.2ug/ml), and cells were analyzed by flow 
cytometry on the Becton Dickinson FACsort. Clones having a greater 
median fluorescence than yeast expressing pYES2hTNFrED-Agg control 
were chosen for rescue of plasmid. DNA plasmids were recovered from 
yeast, and transformed into competent E. coli JM109 (Promega Corp.) 
following the manufacturer's protocol. Purified plasmid DNA was re- 
transformed into BJ2168 yeast cells as described above and individual 
clones were re-analyzed using methods described above. The TNFrED 
regions of positive clones after the re-transformation were sequenced using 
the Thermo Sequenase radiolabeled terminator cycle sequencing kit 
(Amersham Pharmacia Biotech). 
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■ 

TNF-QiBinding assay 

Yeast expressing either the wild-type or mutant TNFrED-agglutinin fusion 
or yeast containing the control vector pYES2 were resuspended in PBS 
/BSA (10 mg/ml) at a concentration of 1 x 10 8 cells/ml. In each well of a 
Durapore 96-multi-well plate (Millipore Corp.), 50 ul of the cell suspension 
were incubated with 50 ul of PBS/BSA (10 mg/ml) containing various 
concentrations of 125 I-TNF-a (Amersham Pharmacia Biotech) for 2 h at 
room temperature. Following the incubation, the wells were washed three 
times with ice-cold PBS using the MultiScreen filtration system (Millipore 
Corp.). Non-specific binding was determined with the yeast containing the 
control vector pYES2 and the non-specific binding was <10% of the total 
counts. The Kd and Bmax was determined using the GraphPad Prism 
program. 

Expression of TNFrED mutants. 

The pCMVhTNFrED constructs were transiently transfected into HEK293- 
EBNA cells using a calcium phosphate method. Transfections were 
completed in triplicate and hTNFrED expression was quantitated from 
medium harvested 48 h post-transfection using an ELISA for hTNF R-l (R 
& D Systems). 

* 

BIAcore Analysis of TNFrED mutants. 

Surfaces displaying polyclonal goat anti-human TNFrED were constructed 
by binding biotinylated antibody (BAF225 from R&D Systems) to 
streptavidin-coated Sensor SA chips (P/N BR- 1000-32, BIAcore Inc.). 
Under the conditions of these experiments the streptavidin-biotinylated anti- 
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TNFrED interaction behaves as if irreversible. Data for the comparison of 
TNFrED/TNF-a interaction between mutants and wild-type TNFrED were 
collected in HBS (lOmM HEPES pH 7.4, 150mM NaCl, 3.4mM EDTA, 
4 0.005% P20) as follows. 

Conditioned media from transient transfections of HEK293-EBNA cells 
were concentrated with Centricon 10s and the amount of TNFrED protein 

8 was quantitated with the R & D Systems ELISA for hTNF R-l. Buffer- 
diluted purified TNFrED (BS03-99, obtained from IRCS) or buffer-diluted 
conditioned medium containing a mutated protein was injected onto a 
BAF225 surface at 50nM, resulting in the formation of antibody-TNFrED 

12 complex. TNF-a at 1 nM (trimer in HBS) was then injected, and kinetics of 
the binding and dissociation were recorded via the time course of the surface 
plasmon resonance response. The BAF-225 surface was regenerated with 
50% 100 mM sodium citrate pH 2.5/50% 100 mM sodium citrate pH 3., 

16 which stripped off TNFrED and TNF-a. This series of injections was 
repeated for TNF-a at 2, 5, 10, 20, 50, and 100 nM while holding the 
concentration of TNFrED injected (wild-type or mutant) at 50 nM. Each set 
of binding curves was fit globally using a model for 1:1 interaction that 

20 includes a term for linear baseline drift. Conditioned medium (no TNFrED) 
and HBS were used as negative controls. The BAF225 surface was 
regenerated with 50% 100 mM sodium citrate pH 2.5/50% 100 mM sodium 
citrate pH 3., which stripped off TNFrED and TNF-a. In our hands it has 

24 been necessary to have fairly mild regeneration conditions to achieve a data 
collection rate sufficiently high to make this technology useful. 
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Analysis of crystal structures. 

A dataset previously described (Wang,Y., Huq,H.L, de,l.C, X & Lee,B. A 

4 new procedure for constructing peptides into a given Calpha chain. Fold 
Des. 3; 1-10 , 1 998) was used to analyze the <|>-\|/ angle distributions of 
proline, serine, histidine and isoleucine residues in high-resolution and non- 
homologous protein crystal structures. This dataset contains 136 x-ray 

8 structures with a resolution of 1 .8 angstroms or higher, and with the 
sequence identity of less than 25% between any pair in the set There are 
803 proline, 1251 serine, 409 histidine and 959 isoleucine residues in the 
dataset. The <|>-\|/ angle space is equally divided into 36x36 bins with an 

12 interval of 10 degree from -1 80 degree to 1 80 degree for <|> or \j/ angle. The 
darkness of a bin in the <j>-\j/ angle distribution diagrams (Figure 3) is 
associated with the frequency of binomial distribution of the amino acid 
residue in the dataset. The higher the frequency, the darker the bin. The 

16 lightest gray bins represent a dimensionless ratio of Z value (fold of 

probable error over the basal probability) of 1 . The Z value of the darkest 
(black) bins is 40 or above, and that of other grey bins is 3, 5, 10 and 30, 
respectively. The basal probability is Dp, where n is the total number of the 

20 amino acid residue in the dataset and the p is 1/(36x36), and the error 
variance q is : r-r. — r 



Both the above dataset and the Hobohm's dataset (Hobohm,U. & Sander,C. 

■ 

24 Enlarged representative set of protein structures. Protein ScL 3, 522-524 , 
1994) were used to search for a tri- or a tetra-amino acid sequence 
corresponding to the same sequence surrounding either the mutated residues 
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or the wild-type sequence. The reason to include the Hobohm's dataset for 
the search is to increase the number of protein structures containing the 
sequence to increase the statistical relevance. There are 168 protein 
structures in the Hobohm dataset, and 35 structures are in overlap with the 
first dataset. The combined dataset contains 269 non-redundant protein 
structures, which represents the different type of folds and sequences of the 
whole PDB database. The pdb codes of the three crystal structures 
containing TNFrED molecules are lext, lncf and Itnr . These structures are 
not in any of the above datasets. 

The invention has been described in detail with reference to preferred 
embodiments thereof. However, it will be appreciated that those skilled in 
the art, upon consideration of this disclosure, may make modifications and 
improvements within the spirit and scope of the invention. 
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CLAIMS 



1 . A method of increasing expression of a protein by an expression 
system wherein the expression system comprises a polynucleotide 
sequence of codons encoding the protein wherein at least one codon of 
the polynucleotide is substituted for the codon encoding proline. 

2. A method of claim 1, wherein the substituted nucleic acid codon 
encoding proline is positioned within 5 nucleic acid codons from a 
nucleic acid codon encoding the amino acid cysteine. 

3. A method of claim 1, wherein the substituted nucleic acid codon 
encoding proline is positioned within 10 nucleic acid codons from a 
nucleic acid codon encoding the amino acid cysteine. 

4. A method of claim 1, wherein the substituted nucleic acid codon 
encoding proline is positioned within 15 nucleic acid codons from a 
nucleic acid codon encoding the amino acid cysteine. 

5. A polypeptide comprising an amino acid sequence wherein at least 
one amino acid in the sequence is substituted for the amino acid 
proline. 

6. The polypeptide of claim 1 wherein the substituted amino acid is 
positioned within 5 amino acids of a cysteine residue contained in the 
polypeptide's amino acid sequence. 

7. The polypeptide of claim 1 wherein the substituted amino acid is 
positioned within 10 amino acids of a cysteine residue contained in 
the polypeptide's amino acid sequence. 

8. The polypeptide of claim 1 wherein the substituted amino acid is 
positioned within 15 amino acids of a cysteine residue contained in 
the polypeptide's amino acid sequence. 
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9. The protein of claim 1 wherein the protein is Tissue Necrosis Factor 
Binding Protein. 

10. The protein of claim 1 wherein the protein is a member of the 
Transforming Growth Factor family of proteins. 

1 1 .A polynucleotide comprising a nucleotide sequence of codons^ 
encoding amino acids wherein at least one codon is substituted for the 
codon encoding proline. 

12. A polynucleotide of claim 7 wherein the substituted codon is 
positioned within 5 codons of a codon encoding cysteine. 

13. A polynucleotide of claim 7 wherein the substituted codon is 
positioned within 10 codons of a codon encoding cysteine. 

14. A polynucleotide of claim 7 wherein the substituted codon is 
positioned within 15 codons of a codon encoding cysteine. 
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FIGURE 1(A) 
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FIGURE 1(B) 
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FIGURE 1(C) 
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FIGURE 1(D) 
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FIGURE 2(A) 
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FIGURE 2(B) 
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FIGURE 2(C) 
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Figure 4 

TNFrED sequence 



GATAGTGTGTGTCCCCAAGGAAhATATATCCATCCT 

1 - + ♦ 4 4 4 60 

CTATCACACAC&GGGGTTCCTTTTATATAGGT 

AspSerValCysProGlnGlyLysTyrlleHisProGlnAsnAsnSerlleCysCysThr 
DSVCPOGKYIHPQNNSICCT 



AAGTGCCACAAAGGAACCTACTIXSTACAATGACTGTCCAGGC^ 

+ + + + + + 120 

TTCACGGTGTTTCCTTGGATGAACATGTTACTGA 

LysCysHisLysGlyThrTyrLeuiyrAsnAspCysProGlyProGlyGlnAspThrAsp 
KCHKGTY LYNDCPGPGQDTD 



TG CAGGG AGTGTGAGAGCXSG a TCCTTCAC t G C t TCAGAAAACCACCTCAGACACTG CCTC 

121 t- + 4 + + 4 4 180 

ACGTCCCTCACACTCTCGCC t AGGAAGTGaCGaAGTCTTTTGGTGGAGTCTGTGACGGAG 
CysArgGluCysGluSerGlySerPheThrAlaSerGluAsnHisLeuAxgHisCy&Leu 
CRECBSGSFTAS ENHLRHCL 



AGCTGCTCXIAAATGCCGAAAGGAAATGGGTCAGGrrGGAGATCTC^ 

181 4 4 4 4 -f 4 240 

TCGACX^GGTTTACGG CTTTCCTTT A CCCAGTCCACCTCTAGAGAAGAACGTGTCAg CTG 
SerCysSerLysCysArgLyGGluMetGlyGlnValGluIleSerSerCysThrValAsp 
SCSKCRKEMGQVEISSCTVD 



CGGGACACCGTGTGTGGCTGCAGGAAGAACCAGTACCGGCATTAITGGAGTGAAA^ 

241 4 4 4 4 4 4 300 

GCCCTGTGGCAC^CACCGACGTC CnxrriX^ rrCATGGCCQT 

ArgAspThrValCysGlyCyeArgLysAsnGlnTyrArgHiBTyrTrpSerGluAsnLeu 
RDTVCGCRKNQYRHYWSBNL 



TTCCAGTGCTTCAATTGCAGCCTCrcCCTCAATC 

301 4 4 4--- 4 4 4 360 

AAGGTCACGAAGTTAACGTCGG AGACXX5AGTTACCCrrc 

PheGlnCysPheAsnCysSerLeuCysLeuAsnGlyThrValHisLeuSerCysGlnGlu 
FQCFNCSLCLNGTVHLSCQE 



AAACAGAACACCGTCTGCACCTCCCATGCAGGTTTCTT^ 

361 4 4 4 4 4 4 420 

TTTGTCTTGTGGCACACGTGGACGGTACGTCCAAAGAAAGA 

J*ysGlnAsnThrValC^sThrCysHisAlaGlyPhePheLeuArgGluAsnGluCysVal 
JCQN*. TVCTCHAG F P LR ENECV 
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Figure 4 Continued 

TCCTGTAGTAACTGTAAGAAAAGCCTGGAGTGCACGAAGTTGTGCCTACCCCAGATTGAG 

421 + + + + + -- --+ 480 

AGGACATCATTGACATTCTTTTCX5GACCTCACGTGCTTC 

SerCysSerAsnCysLysLysSerLeuGluCysThrLysLeuCysLeuProGlnlleGlu 
SCSNCKKSLECTKLCLPQIE 



AATTGA 

481 --r 486 

TTAACT 
AsnEnd 
N * 



